High throughput energy efficient multi - FFT architecture on FPGAs ( Draft ) ∗
نویسنده
چکیده
To process high-rate streaming data, throughput is one of the key performance metrics for FFT design. However, high throughput FFT architectures consume large amount of power due to complex routing or excessive memory access. In this paper, we propose a Cooley-Turkey algorithm based, high throughput energy-efficient multi-FFT architecture. In the proposed architecture, we use multiple time-multiplexed pipeline FFT processors to achieve high throughput. Time-multiplexers instead of routing network are used for shuffling the intermediate data, thus reducing the burden of interconnection power for large FFT problem size. To reduce memory power, a method of dynamic memory activation is developed. For N -point FFT (64 ≤ N ≤ 4096), our designs improve the energy efficiency (defined as GOPS/Joule) by 17% to 26%, compared with a stateof-the-art design. The experimental results show that, for various throughput requirements, the proposed design can achieve 50 ∼ 63 GOPS/Joule, i.e. up to 78% of the estimated peak energy efficiency of FFT designs on FPGAs. Index Terms — Fast Fourier Transform, FPGA computing, High throughput FFT, Energy efficient design
منابع مشابه
Pipeline FFT Architectures Optimized for FPGAs
This paper presents optimized implementations of two different pipeline FFT processors on Xilinx Spartan-3 and Virtex-4 FPGAs. Different optimization techniques and rounding schemes were explored. The implementation results achieved better performance with lower resource usage than prior art. The 16-bit 1024-point FFT with the R22SDF architecture had amaximum clock frequency of 95.2 MHz and use...
متن کاملDesign of a novel congestion-aware communication mechanism for wireless NoC architecture in multicore systems
Hybrid Wireless Network-on-Chip (WNoC) architecture is emerged as a scalable communication structure to mitigate the deficits of traditional NOC architecture for the future Multi-core systems. The hybrid WNoC architecture provides energy efficient, high data rate and flexible communications for NoC architectures. In these architectures, each wireless router is shared by a set of processing core...
متن کاملEnergy-efficient and parameterized designs for fast Fourier transform on FPGAs
In this paper, we develop energy efficient designs for the Fast Fourier Transform (FFT) on FPGAs. Architectures for FFT on FPGAs are designed by investigating and applying techniques for minimizing the energy dissipation. Architectural parameters such as degrees of vertical and horizontal parallelism are identified and a design domain is created through a combination of design choices. We deter...
متن کاملA Survey on Fft/ifft Processor for High Speed Wireless Communication System
The demand for increased channel capacity in mobile and wireless communication has been rapidly increasing due to multi fold increase in demands of multimedia services and mobile data. In the present scenario, high data rate are offered by WLAN, WiMax and LTE/ LTE-Advanced (LTE-A). Developing a wireless system for more spectral efficiency under varying channel condition is a key challenge to pr...
متن کاملSynergy: A HW/SW Framework for High Throughput CNNs on Embedded Heterogeneous SoC
Convolutional Neural Networks (CNN) have been widely deployed in diverse application domains. There has been significant progress in accelerating both their training and inference using high-performance GPUs, FPGAs, and custom ASICs for datacenter-scale environments. The recent proliferation of mobile and IoT devices have necessitated real-time, energy-efficient deep neural network inference on...
متن کامل